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Abstract 

Interceptors  for  tactical  ballistic  missile  defense  typically  are  conceived  to  have 
midcourse  phases  that  make  corrections  to  the  original  interceptor  free-flight  path  based  on 
updated  threat  state  estimates  from  the  filter  associated  with  a  remote  sensor.  Some  concepts 
call  for  one  midcourse  correction,  while  others  call  for  more  frequent  corrections.  The  goal  of 
this  study  is  to  find  the  optimal  frequency  of  midcourse  corrections  from  the  point  of  view  of 
minimizing  the  terminal  error,  as  well  as  to  determine,  for  a  given  design,  the  optimal 
allocation  of  thrust  resources.  It  is  found  that  the  more  frequently  the  corrections  are  made, 
the  less  the  errors  are  that  are  handed  over  to  the  terminal  phase.  Furthermore,  even  when 
less  fuel  is  available  than  that  required  to  take  out  all  known  errors,  the  optimal  strategy  is  to 
make  corrections  as  soon  as  the  amount  of  correction  required  just  equals  the  amount  of 
divert  available  for  each  burn,  until  midcourse  divert  fuel  is  exhausted.  If  continuous  thrust  is 
available  with  a  throttlable  rocket,  then  the  optimal  strategy  is  almost  always  to  continuously 
make  corrections  to  keep  the  known  errors  at  zero,  until  fuel  is  exhausted  or  terminal  homing 
commences. 
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Introduction 


In  order  for  the  performance  of  an 
exoatmospheric  tactical  ballistic  missile  (TBM) 
interceptor  to  be  maximized,  the  amount  of  fuel 
available  for  midcourse  divert  must  be  balanced 
between  having  sufficient  fuel  to  take  out  all 
errors  and  minimizing  kill  vehicle  weight  for 
increased  interceptor  speed  and  reach. 
Furthermore,  for  a  given  amount  of  divert 
capability,  it  is  desirable  to  know  what  the 
optimal  strategy  is  for  scheduling  midcourse 
corrections.  Whether  there  should  be  one  large 
midcourse  correction  or  numerous  smaller 
corrections  needs  to  be  known.  Also,  for  a  given 
number  of  potential  burns,  the  optimal  strategy  as 
to  when  to  apply  these  corrections  needs  to  be 
determined. 

This  paper  addresses  these  issues  using 
optimal  control  theory  and  parameter 
optimization  techniques.  First,  it  is  assumed  for 
this  study  that  a  sensor  that  is  remote  from  the 
interceptor  is  observing  the  threat  at  regular 
intervals,  and  that  the  midcourse  corrections  of 
the  interceptor  are  based  on  the  remote  sensor's 
threat  state  estimate.  In  this  mode,  the  sensor's 
filter  will  typically  reach  a  steady  state  velocity 
error.  This  is  the  basis  for  a  one  dimensional 
motion  optimization  problem  that  captures  the 
essence  of  the  full  problem.  The  goal  of  the 
problem  is  to  find  the  optimal  thrust  strategy  that 
minimizes  the  terminal  error  (the  error  that  is 


handed  over  for  the  onboard  sensor/filter/divert 
system  to  take  out  in  terminal  homing,  by  some 
guidance  method  such  as  proportional 
navigation^. 

It  is  found  that  in  almost  all  cases  the 
optimal  strategy  is  to  make  either  continuous  (if 
possible),  or  else  as  frequent  as  possible,  small 
corrections  all  the  way  to  terminal  homing.  The 
difference  between  numerous  corrections  and 
one  or  a  small  number  of  corrections  can  be 
significant.  Not  only  does  this  make  successful 
intercept  more  probable  for  a  given  amount  of 
divert,  but  it  also  requires  significantly  less  divert 
capability  for  a  given  design  goal. 


Problem  Formulation 

It  is  assumed  that  the  remote  sensor's  filter 
has  reached  steady  state  during  the  time  period 
that  the  midcourse  corrections  are  to  be  applied. 
The  midcourse  corrections  are  based  on 
propagating  the  threat  state  estimate  forward 
from  the  current  time  to  the  time  of  intercept.  It 
is  also  assumed  that  the  intercept  time  does  not 
change  much  during  the  correction  process,  so 
for  this  study  the  intercept  time  is  considered 
fixed.  Given  these  things,  the  error  in  the 
propagation  of  the  threat  state  estimate  is 
approximated  well  by  a  linear  function  of  time. 
That  is,  if  Ov  represents  some  reasonable  upper 
bound  on  the  filter  velocity  error  (such  as  a  3- 
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Standard  deviation  value  under  a  Gaussian 
probability  density  function  assumption),  then  a 
very  good  estimate  of  an  upper  bound  for  the 
position  error  of  the  propagated  threat  at 
intercept  time  is  times  the  time-to-go  until 
intercept  (with  the  initial  position  error  being 
roughly  ignorable). 

Let  if  be  the  time  from  the  start  of  the 
midcourse  phase  until  the  beginning  of  terminal 
homing,  and  Ai/i  the  duration  of  terminal 
homing,  so  that 

T  —  tf  -j-  ^th 

is  the  intercept  time  relative  to  the  beginning  of 
midcourse.  If  x{t)  and  v{t)  represent  the  amount 
of  corrective  position  and  velocity  made  by  time 
t,  then  the  intercept  error  handed  over  to  the 
terminal  phase  is  approximated  by 

avT  -  {x{tf)  +  v{tf)  •  Ath)  . 

The  goal  of  the  optimization  process  is  to 
minimize  this  error. 

In  order  for  the  optimization  process  to 
make  sense,  however,  a  constraint  must  be  added 
to  the  optimization  problem  to  account  for  the 
fact  that  no  greater  correction  can  be  made  at  any 
given  moment  than  the  amount  of  increased 
knowledge  of  the  threat  state,  gained  over  the 
time  span  t.  Since  cr^  (r  —  t)  is  the  propagated 
error  at  time  t,  and  since  •  r  is  the  initial 
propagated  error,  the  difference  between  these, 
av-t  can  be  thought  of  as  the  amount  of  required 
correction  that  is  known  at  time  t.  Therefore,  the 
constraint  is  that  the  position  of  the  interceptor  at 
intercept  time,  if  the  interceptor  coasts  for  the 
remainder  of  the  time  to  go,  cannot  be  greater 
than  the  currently  known  correction  amount, 
av  •  t.  That  is, 

x{t)  v{t){T  -  t)  -(jyt<0.  (1) 

Two  different  formulations  will  be  used  for 
the  dynamics.  First,  a  throttlable  rocket  will  be 
assumed,  so  that 

Pue 

V  =  -  , 

m 

where  /3  €  [0,/3m]  is  the  mass  flow  rate,  Ue  is  the 
equivalent  exhaust  velocity  of  the  rocket,  and  m 
is  the  current  vehicle  mass.  This  leads  to  an 
optimal  control  problem,  with  P  being  the 
control  variable. 

Second,  a  series  of  instantaneous  impulse 
rocket  bums  will  be  assumed,  to  approximate 
burns  of  short  duration  separated  by  coasting 


times.  In  this  case,  the  optimization  process  is  to 
choose  the  optimal  instantaneous  velocity 
corrections  and  corresponding  times  of  the 
corrections  in  order  to  minimize  the  same  cost 
function,  subject  to  the  constraint  that  the  sum  of 
the  velocity  corrections  cannot  exceed  the  total 
allotted,  and  subject  to  Ineq.  (1)  written  in  terms 
of  the  sum  of  the  velocity  corrections  and  the 
time  increments. 

Continuous  Control 


The  differential  equations  of  the  state  are: 


X  ^  V  , 

(2) 

II 

(3) 

and 

m  =  -  P  , 

(4) 

with  initial  conditions 

o 

!l 

(5) 

^(0)  =  0  , 

(6) 

and 

m(0)  =  tUq  . 

(7) 

At  the  given  final  time,  f/ ,  x{tf)  and  v(i/)  are 
unspecified  (that  is,  they  are  free  to  take  on  any 
value  that  minimizes  the  cost  function),  but  the 
vehicle  final  mass 

^  .  (8) 

The  way  this  inequality  constraint  is  handled  in 
practice  is  to  assume  that  m{tf)  is  also 
unspecified,  and  if  the  inequality  is  violated  in 
the  solution,  then  the  solution  is  attempted  again 
with  the  equality  constraint  m{tf)  =  m^un  ♦ 

Then  the  problem  is 

iTiin  -f  'u(i/)*A4)|  (9) 

subject  to  the  control  variable  constraint 
/3e[0,^ni] 

and  the  state  constraint 

Q{t)  <  0  ,  (10) 

where 
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Q{t)  -  x{t)  +  v{t)  •  {r -t)  -  av -  t  .  {ll) 


,  (17) 


In  order  to  solve  this  optimal  control  problem, 
first  the  Hamiltonian  is  set  up  in  the  usual 
manner^, 

H  ~  \x  *  V  +  \v  *  — —  ~  »  (12) 

m 


and 

Xmitf)  -  0  (18) 

when  the  final  mass  is  not  specified,  or 

Xm{tf)  =  ly  (19) 


where  the  costate  variables  A^,  A^;,  and  Xm  satisfy 
the  differential  equations 


A.=  -f  =  0.  (13) 


and 


— 


dH  _  PUe 


(14) 


(15) 


The  endpoint  conditions  on  these  costate 
variables  are  determined  by  the  transversality 
conditions. 

Since  all  three  state  variables  are  specified  at 
the  beginning  time,  transversality  states  that  the 
costate  variables  at  this  time  are  free  (that  is,  they 
are  free  to  take  on  whatever  value  is  required  to 
solve  the  resultant  boundary  value  problem).  At 
the  final  time,  t/,  each  Xi  (for  i  G  {a:,  i;,  m}) 
must  satisfy 


d(t>  dip 


when  m{tf)  =  mmin  (that  is,  Xm{tf)  is  a  free 
parameter). 

On  points  of  the  trajectory  where  the  state 
constraint  Ineq.  (10)  is  not  active,  given  the  state 
and  costate  variables  on  an  optimal  trajectory, 
the  optimal  control  value  is  found  through  the 
Minimum  Principle,  which  states  that  the  optimal 
P  satisfies 

min  H  . 

P 


Since  Eq.  (12)  is  an  affine  function  in  /3,  the 
value  of  the  optimal  P  always  lies  at  one 
endpoint  of  the  interval  [0,/3m],  unless  the  slope 
is  zero.  Therefore, 


P^ 


,5>0 
,  5<0 


(20) 


where 

(21) 

m 

If  5  ^  0  for  some  finite  amount  of  time,  the 
control  is  termed  singular;  this  case  will  be 
discussed  in  the  last  subsection. 


where 

(p  =  Gvr  -  {x{tf)  +  v{tf)  •  A4) 

is  the  Mayer  cost  term,  ip  is  the  terminal 
condition  vector,  and  is  a  free  parameter  to  be 
determined  to  satisfy  the  resulting  conditions. 
Since  x{tf)  and  v{tf)  are  free  parameters,  ip  will 
not  contain  any  x  or  v  values  in  it.  For  the  case 
where  the  final  mass  is  free, 

^  _  0 

(that  is,  there  are  no  terminal  conditions  on  the 
state),  and  for  the  case  where  it  is  specified, 

Ip  =  m  —  rrif  . 

Therefore, 

Xx{tf)  -  -  1  ,  (16) 


golation  Prpc^gg 

The  process  used  to  obtain  solutions  to  this 
problem  is  a  two-step  method.  First,  a 
differential  inclusion  direct  method  is  used  to 
obtain  numerical  estimates  of  the  solutions,  with 
particular  emphasis  given  to  finding  the 
appropriate  switching  structure  for  each 
problem^  Typically,  a  multiple  shooting  method 
could  be  used  at  this  point  to  obtain  refined 
solutions  that  satisfy  the  two-point  boundary 
value  problem  of  the  six  dimensional  combined 
state  and  costate  equations,  with  state  boundary 
conditions  and  costate  transversality  conditions, 
and  incorporating  the  form  of  the  optimal  control 
in  Eq.  (20).  For  this  problem,  however,  the 
differential  equations  are  simple  enough  to  be 
integrated  analytically  for  most  solution  types,  so 
the  different  switching  structures  indicated  by  the 
numerical  solution  are  analytically  pieced 
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together  to  obtain  appropriate  solutions.  In  some 
cases  the  solutions  are  simple  enough,  and  the 
logic  from  the  optimal  control  formulation  is 
clear  enough,  that  the  solutions  can  be  concluded 
without  requiring  full  integration  of  the  two-point 
boundary  value  problem. 

No  Path  Constraint  Solution 


As  a  first  case  to  solve,  assume  that  there  is 
no  path  constraint,  Ineq.  (10),  or  else  that  cr^  is 
sufficiently  large  compared  to  /Jm  such  that  the 
constraint  can  never  be  reached.  The  solution  to 
this  problem  is  the  simplest  of  all  of  the  cases 
(and  it  is  the  most  intuitively  obvious),  yet  it 
illustrates  many  of  the  methods  and  ideas  needed 
for  the  more  complicated  cases  presented  later. 
In  this  way,  some  of  the  more  important  features 
of  the  different  types  of  solutions,  as  well  as  the 
different  solution  methods,  can  be  presented 
before  the  additional  complications  of  state 
constraints  and  singular  arcs  are  introduced. 

The  solution  to  the  differential  equation  for 
Xx  ,  Eq.  (13),  is  a  constant,  and  the  transversality 
condition,  Eq.  (16),  implies  that 

Xx{t)  -  -  1  (22) 

as  long  as  there  are  no  jumps  in  the  adjoint 
variables.  Since  jumps  in  the  adjoints  can  only 
occur  when  a  constraint  is  encountered,  Eq.  (22) 
is  the  correct  solution  in  this  case.  Similarly, 
straightforward  integration  of  Eq.  (14)  with  the 
application  of  the  terminal  boundary  condition, 
Eq.  (17),  results  in 

Xv{t)==t~T.  (23) 


The  solution  for  is  more  complicated 
since  the  differential  equation  Eq.  (15)  involves 
/3{t)  and  m{t),  so  the  switching  structure  for  P{t) 
must  be  known.  To  discern  the  correct  switching 
structure,  consider  the  time  derivative  of  the 
switching  function,  Eq.  (21),  which  is 


S  = 


Ue  ^  ,  'iJ'cXv  ^ 

Ax  H  ^  p 

m 


Or, 

S=-^X,.  (24) 

m 


Since  Ue  >  0,  m  >  0,  and  A®  <  0, 

S  >  0 .  (25) 

In  other  words,  S{t)  is  monotonically  increasing. 
A  typical  solution  of  S{t)  is  shown  in  Figure  1. 


Combining  the  monotonicity  of  S{t)  with  Eq. 
(20),  I3(t)  is  either  full  on  all  the  way  (so  long  as 
Ineq.  (8)  is  not  violated),  or  else  it  is  full  on  to 
full  off  at  the  time  that  S{t)  passes  through  zero. 


Figure  1.  A  Typical  Switching  Function  Solution. 


For  the  case  when  m{tf)  is  free,  Xm(tf)  =  0 
from  transversality  (Eq.  (18)).  Using  this  plus 
the  transversality  condition  on  A,;  (Eq.  (17))  in 
Eq.  (21)  at  the  final  time. 


Sit,)  = 


Ue  Ath 
m{tf) 


which  implies  that 

S{tf)  <  0  . 

Combined  with  the  monotonicity  of  S'(t),  this 
means  that  the  optimal  solution  for  the  final-mass 
free  case  is  to  thrust  at 

Pit)  =  Pm  V  te[o,tf]. 

For  the  case  when  m{tf)  is  specified,  Xmitf) 
is  free;  so  the  terminal  condition  on  the  mass  is 
what  determines  the  control.  Since  the  only 
possible  solution  is  full-on  to  full-off,  it  must  be 
that 

Pit)  —  pm 

until  some  switching  time,  say  ts  ,  at  which 
mits)  =  mnu„.  Or, 

TTIq  Pmts  ^^min  > 


SO  that 

_  rrip  - 

'  "  Pm  ' 

With  this  knowledge  of  the  switching 
Structure,  A;n(t)  can  be  solved  for  the  two  cases. 
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For  all  of  t  for  the  first  case,  and  for  t  <ts  for 
the  second  case,  the  differential  equation  for  Am 
is 

_  PmUejt  -  t) 

(mo  -  Pmty  ’ 


which  can  be  directly  integrated  to  yield 


Am(f)  = 


ln{mo  —  Pmt)  + 


imp  -  Prar)' 
{rrip  -  Pmt) . 


+  C. 


For  the  final  mass  free  case,  using  Eq.  (18)  to 
determine  the  constant  of  integration  yields 

f  {mo  -  PmT)Mt  -  tf)  ' 

’  ~  /3„  [  \mo  -  ^rpti)  ^  (mo  - 


For  the  final  mass  specified  case,  S{ts)  =  0 
determines  C.  S{ts)  —  0  implies  that 


Am(fs) 


Uejts  -  t) 

^min 


SO  that 


A™(t)  =  ^ 

ym 


In 


(yyip 


/  mp  -  (3mt  \  _ 

\77lp  —  jSmts )  '{mo  —  l3mt){mo  — 
Uejts  -  r) 


+  - 


^min 


for  t  <  ts,  and 


Xmit)  = 


Uejts  -  t) 
^min 


for  t>ts 

The  conclusion,  then,  is  that  when  there  is 
no  path  constraint  (which  is  only  of  academic 
interest),  or  when  the  path  constraint  can  never 
be  reached  for  a  given  /?ni  liniit  on  the  mass  flow 
rate  (which  signifies  that  that  sensor/filter  errors 
are  too  large  for  the  given  divert  capability),  the 
optimal  strategy  is  to  thrust  at  the  maximum  level 
until  either  the  start  of  terminal  homing  or  the 
fuel  available  is  exhausted.  The  former  case  is 
self  evident.  The  latter  case  results  because  the 
earlier  a  correction  is  made,  the  more  the 
terminal  error  that  can  be  taken  out,  because  the 
velocity  times  time-to-go  is  greater. 

Costate  Jumps  Upon  Hitting  the  State  Constraint 

For  all  cases  other  than  the  cases  where  there 
is  no  state  path  constraint  or  where  the  constraint 
cannot  be  reached,  the  possibility  of  being  on  a 
trajectory  that  hits  the  constraint  must  be 
considered.  In  general,  the  costates  and  the 
Hamiltonian  (and  hence  the  switching  function 


S(t) )  will  be  discontinuous  at  the  time  when  the 
trajectory  encounters  the  state  constraint"^.  The 
discontinuity  for  at  the  time  the  constraint  is 
entered  is 


_  .  dQ 

Ax  =  A^  +  M  , 

or 

+ 

II 

(26) 

Similarly, 

A„  =  a;;-  +  njr  - 1) 

(27) 

and 

\-  —  \  + 

(28) 

Also,  the  Hamiltonian,  Eq.  (12),  must  satisfy 

(29) 

Plugging  Eq.s  (26),  (27)  and  (28)  into  Eq.  (29), 
using  Eq.s  (12)  and  (11),  yields  the  necessary 
condition 


'P  ('^-*)„  ' 

Uq  ay 

— 

’A+  Ue  . 

m 

m 

where  A/?  is  the  jump  in  the  control  at  that  time. 
This  equation  can  be  satisfied  either  by 


A+Ue  -mXrn 
p-{T-t)ue  -  may 


Ap  ,  (30) 


or  else  by 


may 
Ue  (r  ”  t) 


(31) 


combined  with 

A/3-0 


or 

5+-0  . 

It  is  interesting  to  note  that  Eq.  (31)  is  the  same 
control  level  required  to  keep  the  trajectory  on 
the  constraint,  as  presented  in  the  next  section. 
This  means  that  Q  =  0  at  the  time  the  constraint 
is  encountered  if  the  control  is  continuous  across 
the  jump. 

Large  Mass  Flow  Rate  Solution 

Assume  now  that  the  mass  flow  rate  is 
arbitrarily  large,  so  that  /3in  ^  oo.  This  case  is  of 
interest  for  designing  a  system,  in  which  case  the 
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maximum  resulting  /Jm  will  guide  the  design 
thrust  level  And,  in  practice,  this  case  is  solved 
first,  and  if  a  /?m  limit  is  violated,  then  the 
problem  is  solved  again  with  new  switching 
structures  consistent  with  the  limit. 

The  numerical  solutions  for  large  mass  flow 
rate  cases  all  show  that  following  the  constraint 
from  the  start  until  either  time  tf  is  reached,  or 
else  until  mass  mmi„  is  reached,  is  the  optimal 
solution. 

When  the  constraint  is  encountered,  Q{t) 
from  Eq.  (11)  must  equal  zero.  Furthermore, 
Q{t)  must  also  be  zero  to  keep  the  trajectory  on 
the  constraint.  Setting  Q  to  zero  and  solving  for 
j3  yields  the  control  value  required  to  keep  the 
trajectory  on  the  constraint.  That  is, 

Q  =  -t)~av  ,  (32) 

m 


so  that. 


_  gym 

Ue{T  - 1)  * 


(33) 


Now,  m  =  —  so 


f  dm 

J  m 


/ 


gy  d't 

Ue{T  “  t) 


or, 


For  the  case  when  the  trajectory  starts  on  the 
constraint,  m(0)  =  mo,  so  that 

(T  \  “CTu/We 


To  solve  for  the  corresponding  v{t),  the 
differential  equation,  Eq.  (3),  with  the  right  side 
of  Eq.  (33)  substituted  in,  becomes 


which  has  the  solution  (with  initial  condition 
Eq.  (6)) 

v{t)  =  gy  • 

Integrating  this  to  obtain  position,  x{t),  with  the 
initial  condition  Eq.  (5),  and  using  Eq.  (34)  to 
simplify, 

x{t)  =  gyt  —  v{t)  ‘  (r  —  t)  .  (35) 

Consider  now  the  necessary  conditions  of 
optimal  control  for  this  case.  On  the  path 


constraint,  the  (j  =  0  constraint  must  be  adjoined 
to  the  Hamiltonian,  with  the  associated  Lagrange 
multiplier  a{t),  so  that 

H  =  X^.v  +  K-—  -  XmP 


Since  x  and  v  are  not  in  Q,  the  and 
differential  equations  remain  unchanged.  For  A^ 

Am  =  ^  [K  +  air  -  t)]  .  (36) 

a{t)  is  computed  by  taking  || ,  setting  it  to  zero, 
and  solving  for  a,  which  yields 

7n\fn  ligA-y 
^  Ue{T  -  t) 

Plugging  this  into  Eq.  (36)  yields 

_/3Am 

Am  —  , 

m 

which  has  the  solution 

Xm  =  C{T-t)-".  (37) 

Consider  now  the  case  where  the  trajectory 
starts  with  thrust  off,  switching  to  full  thrust  on 
until  the  constraint  is  hit.  When  the  final  mass  is 
free,  transversality  demands  that  Am(f/)  be  zero 
(Eq.  (18)).  Plugging  this  into  Eq.  (37)  yields 
(7  =  0,  so 

A„^(^)  -  0 


is  the  solution  when  the  constraint  is  ridden  to 
the  final  time. 

Next,  plugging  this  into  Eq.  (21)  at  the  time 
the  constraint  is  hit,  thu , 


S{ttu)  = 


Ue  ^vithit) 


and 


SitKu)  = 


Ue 

m{thit) 


=  —  (r-fAit)(M-l)- 
m 
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For  the  interceptor  to  have  been  thrusting  when  it 
hit  the  constraint, 

^  /X—  1<0, 
or 

jU  <  1. 

On  the  other  hand,  considering  Eq.  (21),  it  is 
required  that  5  <  0  for  a  bang-off  to  bang-on 
solution.  Examining  Eq.  (24),  this  in  turn 
necessitates  A"  >  0.  Also,  since  transversality 
requires  Aa:(^/)  =  —  1,  Eq.  (26)  requires 

fj>>  1  . 

Since  this  conflicts  with  the  previous  inequality 
on  fi,  this  switching  structure  can  never  happen. 
Therefore,  when  the  final  mass  is  free,  a  bang-off 
to  bang-on  to  constraint  solution  does  not  satisfy 
the  necessary  conditions  for  optimal  control. 
Riding  the  constraint  the  whole  way  is  the 
optimal  solution. 

If  the  final  mass  is  constrained,  the  optimal 
solution  is  to  start  on  the  constraint,  and  ride  it 
until  the  available  fuel  is  expended.  Showing 
that  this  satisfies  the  necessary  conditions  is  a 
straightforward  piecing  together  of  the  adjoint 
solutions  derived  above  (and  the  switching 
function  S)  for  the  constrained  arc  following  by 
the  coasting  arc.  It  is  also  straightforward  to 
show  that  if  the  trajectory  coasts  first,  then 
thrusts  at  the  maximum  until  the  constraint  is  hit, 
then  this  always  burns  more  fuel,  requiring  the 
thrust  to  be  terminated  earlier  and  the  terminal 
error  to  be  higher.  Therefore  riding  the 
constraint  from  the  start  until  the  fuel  is 
expended  is  the  optimal  solution. 

Thrust-Limited  Solutions 

Consider  now  the  case  when  the  (3^-^  oo 
solution  exceeds  the  true  /?ni  value.  For  example, 
a  typical  solution  for  the  optimal  thrust  level 
(where  the  thrust  T  —  (3  *Ue)  for  riding  the 
constraint  the  whole  way  is  shown  in  Figure  2. 
The  parameters  used  here  are  cr^;  =  1 50  m/s; 
tf  —  lOOs;  A4  — 10  s;  and  Ue  —  2455mfs. 
Suppose  that  the  maximum  thrust  is  set  to  1000 
N,  which  corresponds  to  /?m  =  0.407323  kg/s. 
Then  the  time  at  which  the  maximum  thrust  is 
reached  is  t=  75.036  s.  One  solution  that 
satisfies  the  necessary  conditions  for  optimal 
control  is  to  ride  the  constraint  until  the  time,  ts, 
that  the  value  of  /3  required  to  stay  on  the 
constraint  reaches  /?m  » then  switch  to  P{t)  =  /?m 


for  t>ts.  The  resulting  cost  function  value  for 
this  case  is  2793  m. 


Figure  2.  Thrust  for  the  Unlimited  Thrust  Solution. 

For  all  of  the  cases  examined  by  the  authors, 
however,  a  bang-off  to  bang-on  solution  always 
produces  slightly  lower  cost  (2770  m  for  the 
example  presented  here).  The  solution  starts 
with  thrust  off,  switching  to  P{t)  =  (3^  at  time  ^5. 
The  switch  time  ts  and  the  jump  value  /x  are 
iterated  upon  until  Eq.  (31)  is  satisfied  at  some 
time  thit ,  at  which  time  the  constraint  is  just 
touched  as  Q{t)  reaches  a  maximum.  Fig.  3 
gives  a  plot  of  Q{t)  for  the  same  example 
mentioned  above.  The  solution  parameters  are 
/X  =  1.0126,  and  ts  =  28.3787,  yielding 
thit  =  75.3712 . 


Figure  3.  State-Constraint  Function  for  the  Bang-OfT  to 
Bang-On  Solution. 

Once  the  mass  constraint,  Eq.  (8),  is 
encountered  in  a  solution,  however,  the  simple 
bang-bang  solution  is  no  longer  an  option, 
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because  the  switch  time  must  change  in  order  to 
bum  less  fuel,  in  which  case  Q{t)  can  never 
touch  zero,  so  that  the  costates  can  never  jump  to 
the  values  required  by  terminal  transversality. 
The  only  other  possibility  would  be  to  keep  the 
same  switching  time  and  terminate  the  thmst 
when  m{t)  =  mmin  .  This  cannot  be  optimal, 
though,  because  S{t)  must  reach  zero  at  the  time 
the  thrust  is  switched  off,  but  from  the  mass  free 
solution  S{tf)  <  0  . 

For  this  case  it  turns  out  that  the  optimal 
solution  is  to  ride  the  state  constraint  until  /3m  is 
encountered,  then  ride  the  Pm  constraint  until  the 
fuel  is  exhausted,  then  set  /3(f)  =  0  after  that. 
Take  the  example  presented  above.  The  final 
mass  for  the  mass-free  bang-bang  solution  is 
7n{tf)  -  220, S  kg,  with  mo  =  250A:^.  If  the 
required  final  mass  is  now  restricted  such  that  it 
can  be  no  less  than  m^in  =  225  kg ,  then  the 
solution  via  riding  the  constraint  has  a  cost  of 
3078  m.  By  contrast,  the  bang-bang  cost  is 
3465  m .  Also,  for  comparison  purposes,  if  the 
Pm  restriction  is  lifted,  yet  the  ~  225  kg 
restriction  is  maintained,  then  the  optimal 
solution  is  to  ride  the  constraint  until  fuel  is 
exhausted,  resulting  in  a  cost  of  2942  m. 

Singular  Control 

In  order  for  an  optimal  control  study  to  be 
complete,  the  possibility  of  singular  control  must 
be  investigated  to  see  if  the  global  optimum 
includes  any  singular  arcs^  For  singular  control, 
S{t)  must  be  identically  zero  for  some  finite 
amount  of  time,  so  S{t)  and  all  of  its  derivatives 
must  equal  zero  on  singular  arcs.  Typically, 
derivatives  of  S{i)  are  set  to  zero  to  determine 
conditions  necessary  to  have  singular  control;  at 
some  derivative  the  control  appears  (always  at  an 
even  derivative),  and  the  resulting  equation  is 
used  to  solve  for  the  value  of  the  control.  To  this 
end,  consider 


m 


Since  Ue  >  0  and  m  >  0,  it  is  required  that 
on  singular  arcs.  Now  the  second  derivative  is 

s= 

Here  the  control  has  appeared,  but  \x  =  0,  so 
that  any  control  will  satisfy  the  necessary 


conditions.  This  solution  trivially  satisfies  the 
generalized  Legendre-Clebsch  condition® 


(-1)  — 
^  ^  dp 


>0, 


k  —  1,2,... 


with  strict  equality  since  Xx  is  a  factor  in  all  the 
derivatives  of  S{t). 

Xx,  however,  must  be  equal  to  —  1  at  the 
terminal  time,  tf,  so  the  constraint  must  be 
encountered  in  order  for  it  to  jump  from  0  to 
—  1.  This  means  that  fj.  =  lin  Eq.s  (26)  -  (29). 
Therefore,  from  Eq.  (29)  with  jj,—  I, 

Ai;(f)  =  0  j  t  <C.  tfiit  J 


where  thu  is  the  time  the  constraint  is 
encountered.  This  in  turn  implies  that  (see 
Eq.  (21)) 

S{t)  =  Xfn{thit)  j  f  ^hit  7 

which  in  general  is  not  equal  to  zero  except  for 
the  case  where  thu  =  f/  and  final  mass  is  free. 

The  implication  of  this  is  that  if  the 
constraint  can  be  hit  at  the  final  time  by 
following  any  control  history,  it  is  an  optimal 
control.  This  is  obviously  true,  since,  as 
mentioned  previously,  hitting  the  constraint  at  the 
final  time,  if  it  is  possible  to  achieve  this,  is  the 
global  optimum.  So,  for  example,  if  Pm  is 
sufficiently  large  and  if  there  is  sufficient  fuel, 
then  /3(f)  could  be  such  that  the  trajectory  is  off 
of  any  of  the  extremals  mentioned  above,  as  long 
as  thrust  could  be  made  sufficiently  large  near 
the  end  for  the  constraint  to  be  hit  at  f/. 

This,  however,  is  only  of  theoretical  interest, 
since  it  yields  no  insight  into  the  structure  of 
general  optimal  trajectories;  also,  from  a  real- 
world  perspective,  if  ay  ,  for  example,  is  larger 
than  anticipated,  then  following  a  strategy  of 
waiting  for  as  long  as  possible  to  take  out  the 
errors  (when  it  is  estimated  that  extra  fuel  and 
thrust  are  available)  could  lead  to  mission  failure. 
All  indications  are  that  following  the  constraint 
the  whole  way  is  the  minimum  fuel  solution, 
which  would  leave  extra  fuel  for  handling 
unexpected  errors  near  the  end  of  midcourse. 


Discrete  Burn  Solutions 


Parameter  Optimization  Formulation 


For  the  n  discrete  burn  problem,  the 
minimization  problem  corresponding  to  Eq.  (9)  is 
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min  E 
AVi,  i  =  l,...n 

Ati ,  i  = 


where 


E^avT  ~  ^Avm  [  T  -  j  , 


m=l 


l=l 


subject  to 


m  m  /  i  \ 

9m  =  -  Y^Avi  I  r  -  |  >  0  , 

i=i  t=i  \  j=i  / 

m=  1, . . .  ,n 


g„+i  =  Aur  -  ^A-Wi  >  0  , 


i=l 


n 

9n+2  =  tf-  ®  ’ 

2=1 


and 


9n+2+m  —  AVm  ^  0  > 

m  =  1, ...  ,n 

where  Avj  is  the  total  amount  of  available 
velocity  correction  corresponding  to  burning 
until  the  divert  fuel  reaches  mmin ;  so 

AVt  —  Ue'ln 


Hereafter,  it  will  be  tacitly  assumed  that  the 
constraints  gn+2+m  ,  m  =  1, . . . ,  n  ,  are  satisfied. 

Taking  partial  derivatives  to  apply  the  usual 
necessary  conditions  for  parameter  optimization, 


BE 

dAvk 


=  -  r  + 


(=1 


dgn+i 

dAvk 


^9n+l  _  Q 

dAtk 


and 


^9n+2  _  Q 

dAvk 


dgn+2 

dAtk 


The  necessary  conditions  are 


dE 

dAvk 


n+2 


m—l 


dgm 

dAvk 


and 


dE 


72+2 


dg„ 


dAtk  dAtk 


=  0  . 


Upon  substitution  of  the  partials  above,  these 
become 


—  T  +  Y^Ati  +  ~  ^^Afj 


k 

Y. 

I— I  m—k  \  1=1 

-  A„+i  =  0  (38) 


and 


^Avi  +  [  cr^  + 


l=k  m=k 


l=k 


-  Xn+2  =  0  .  (39) 


Special  Cases:  n=l  and  n=2 


dE 

dAtk 


=  E^^f”  ’ 

m=k 


dgm  ^  f 
dAvk  \ 


-T  +  EAti 
1=1 
0 


,  k  <m 
,  k  >  m 

,  k  <m 
,  k>  m 


dgm 

dAtk 


For  n  =  1  and  n  =  2,  the  necessary 
conditions  Eq.s  (38)  and  (39)  can  be  solved 
directly  analytically.  For  n—  1,  if 


AVt  >  (7v 


h 

Ath 


(that  is,  if  there  is  more  than  sufficient  fuel),  then 
apply 


-  (Ty 


Atk 


at 
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Otherwise,  apply 


Ah  =  tf . 


Avi  =  Avt 


at 

Avt 

Ah  = - 1 — T  • 

(7y  +  AVt 

For  71  =  2,  the  fuel-limited  case  has  a  unique 
solution 


A-ui  =  Av2  ■ 


Avt 


with 


and 


A  J.  AVt 

Ah  ==  - 1 —  •  t 

2cr^  +  AVt 


^  2  AVt  (Tv 

At2  =  — - —T - TTT  •  r 


{2crv  +  AvtY 


-^n+l 


Then  using  this  numerical  value  in  Eq.  (38)  for 
the  rest  of  the  indices,  compute 

qt  =  ^  -  r  +  ~ 

/c:=n-l,n-2,  ...,1  (42) 

and  check  that 

qk  =  0,  /c  =  n- (43) 
Also,  it  must  be  checked  that 

Afc<0,  fc=l,2,...,n-f-l  .  (44) 


Taking  the  cue  from  the  n=2  solution,  a 
reasonable  guess  is  that 


Avi  = 


Avt 

n 


i  =  1, ...  ,n  . 


That  is,  the  solution  is  to  equally  divide  up 
all  the  velocity  corrections,  and  wait  to  do  each 
burn  until  the  amount  of  correction  estimated  by 
the  filter  just  equals  the  amount  of  divert  for  each 
bum. 

General  Fuel-Limited  Solution 


This  does  indeed  satisfy  Eq.s  (43)  and  (44).  The 
corresponding  Ati  are  obtained  from  Eq.  (41). 

So,  as  in  the  two  burn  case,  the  optimal 
solution  is  to  equally  divide  up  all  the  velocity 
corrections,  and  wait  to  do  each  burn  until  the 
amount  of  correction  estimated  by  the  filter  just 
equals  the  amount  of  divert  for  each  burn. 


It  turns  out  that  if  a  guess  for  the  A^;^  is 
given,  then  the  following  computations  can  be 
made.  First,  Xk  can  be  computed  by  solving 
Eq.  (39)  for  it,  yielding 

n  n  /  m 

l=k  77i=fc+l  \  l=k 

Gv  +  Avk 

A:  =  n,n  —  1, ,  1  .  (40) 

Atk  can  be  obtained  by  solving  =  0, 
771—  fork: 


More-than-Sufficient  Fuel  Results 

If  the  interceptor  has  more  than  sufficient 
fuel,  then  gn-\-2  —  0  and  Qn+i  >  0,  so  that 
An+2  <  0  and  An+i  =  0.  Eq.  (38)  then  becomes 
for  A:  =  n 

(-r4-t/)(An  +  l)  =  0  An=-1. 

With  k  =  n-l,Eq.  (38)  is 

(  —  T  +  t/  ~  Ain)  (An  +  An-1  +  1)  ==  0 

^  An-1  =  0  • 


{Jfc-1  /  k-1  \ 

-  +  At;*  f  T  -  1 


Successively  substituting  for  n  —  2, . . . ,  1  yields 
that 

An-i  =  An-2  =  • .  •  =  Ai  =  0  . 


jfc-i 


i=\ 


{av-\-Avk)  (41) 


A:  =  1,2, ...  ,n  . 
Then,  with  k  =  nm  Eq.  (38), 


Next,  substituting  these  Ai  into  Eq.  (39), 
beginning  with  k  =  7i, 

AVji  *4“  An(^i;  4”  A^n)  —  An4-2 


^  An+2  —  (^v  * 
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Then,  by  direct  inspection  with  this  value  of  An+2 
and  An+i  —  —  1  used  in  Eq.  (39)  with  A;  <  n,  it 
is  seen  that  the  equations  are  satisfied  for  all 
Avi . 

This  is  analogous  to  singular  control  in  the 
continuous  control  case.  Any  history  of 
corrective  bums  that  bring  the  trajectory  onto  the 
constraint  at  the  final  time  yields  an  optimal 
trajectory,  since  this  is  the  global  optimum. 

On  the  other  hand,  it  can  be  shown  that  of  all 
the  possible  solutions  for  this  singular  case,  the 
one  which  yields  the  minimum  fuel  expended  is 
to  ride  the  constraint  the  whole  way.  That  is,  if 
there  is  more  than  sufficient  fuel,  then  another 
optimization  problem  can  be  set  up,  with  the 
same  path  constraint,  plus  the  additional 
constraint  that  the  path  constraint  must  be  a  strict 
equality  constraint  at  time  tf.  Then  the  cost  to  be 
minimized  is  —  m(t/)  (orj^^^i)*  The  solution 

to  this  problem  is  to  keep  correcting  back  to  the 
constraint  for  all  burns.  The  minimum  fuel- 
expended  solution  is  significant  from  a  practical 
point  of  view,  because  it  leaves  room  for  further 
corrections  if,  for  example,  unexpectedly  large 
filter  errors  are  encountered. 

Implications  of  Results 

To  complete  the  picture  presented  above. 
Fig.  1  depicts  the  results  from  the  discrete  burn 
section  for  a  variety  of  numbers  of  burns,  for  the 
same  numbers  used  in  examples  above:  =  150 
m/s;  tf  =  100  s;  Ath  =  10  s;  Ue  =  2455  m/s; 
mo  =  250  kg;  and,  m^in  =  225  kg.  For 
comparison,  the  continuous  bum  results  yield  a 
value  of  2942  m.  Clearly,  the  discrete  bum 
results  are  approaching  the  continuous  result  as 
the  number  of  burns  approaches  infinity. 


Figure  4,  Terminal  Error  versus  Number  of  Bums 


All  of  the  results  above,  except  for  the  thrust 
limited  but  final-mass  free  case,  point  to  the  same 
basic  solution  structure.  This  structure  is  to  fully 
take  out  the  errors,  either  continuously,  or  else  as 
soon  as  the  correction  required  equals  the  amount 
of  divert  available  for  the  next  burn. 

Even  in  the  thrust-limited  but  mass-free  case, 
following  the  constraint  until  the  thrust  level 
reaches  the  maximum,  then  following  that 
maximum,  is  an  extremal.  It  also  typically  has 
only  about  1%  increase  in  cost  over  the  bang- 
bang  results.  Also,  once  the  mass  limit  of  the 
interceptor  is  reached,  the  bang-bang  solution 
rapidly  becomes  far  less  optimal.  Also,  on  a 
related  note,  even  when  the  mass  limit  is  not 
reached,  the  bang-bang  solution  burns 

significantly  more  fuel  than  the  constraint¬ 
following  solution. 

This  means  that  using  the  constraint¬ 

following  extremal  is  a  much  more  conservative 
approach,  being  better  equipped  to  absorb 

unexpected  errors  ft-om  various  sources.  If  the 
bang-bang  solution  happened  to  have  a  large 
performance  improvement  over  the  other 

extremal,  it  would  be  significant  to  incorporate  it. 
As  it  is,  the  marginal  improvement  it  provides, 
combined  with  the  higher  margin-of-error  risk  it 
entails,  probably  means  that  it  is  only  of 
theoretical  interest;  the  practical  result  appears  to 
be  to  use  the  constraint-following  extremal. 

As  final  confirmation  of  this,  it  was 
attempted  to  find  a  numerical  solution  to  the 
discrete  burn  problem  with  a  large  number  of 
burns  for  the  case  that  yielded  a  bang-bang 
solution,  but  the  answer  always  went  to  the 
constraint-following  solution.  This  apparently 
means  that  this  bang-bang  solution  disappears 
when  discrete  updates,  instead  of  continuous 
updates,  are  passed  from  the  remote  sensor  to  the 
interceptor.  In  conclusion,  even  for  the  one 
exceptional  case,  it  appears  that  considering  the 
problem  under  a  variety  of  performance 
objectives  points  to  following  the  constraint  until 
either  the  maximum  thrust  level  is  reached,  or 
else  the  fuel  is  exhausted,  is  the  preferred 
strategy. 

Returning  to  the  general  implications  of  the 
results,  from  Fig.  4  it  is  evident  that  making 
numerous  corrections  along  the  interceptor  path, 
as  opposed  to  making  only  one  or  two 
corrections,  significantly  improves  performance. 
For  the  case  presented,  about  100%  fuel  increase 
would  be  required  for  the  one  bum  interceptor  to 
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achieve  the  same  error  removal  level  as  25  to  50 
bums  (which  correspond  to  a  correction  every  2 
to  4  seconds).  This  kind  of  performance 
juxtaposition  is  typical  for  a  wide  range  of  values 
of  tf,  Ath,  etc. 

Conclusions 


Hemisphere,  Washington,  D.  C.,  1975,  pp.  246- 
270. 

^Kelley,  H.  J.,  Kopp,  R.  E.,  and  Moyer,  A.  G., 
“Singular  Extremals,”  Optimization  Techniques, 
Vol.  2,  edited  by  G.  Leitmann,  Academic  Press, 
New  York,  1967. 


It  has  been  shown  that  an  optimal  strategy  as 
to  when  to  make  midcourse  corrections  is  to 
continuously,  or  least  frequently,  take  out  the 
errors  in  the  interceptor  path,  as  the  filter 
associated  with  the  remote  sensor  makes 
improvements  in  the  estimate  of  the  threat  state 
propagated  to  the  intercept  point.  Also,  the 
performance  increase  in  having  numerous 
midcourse  corrections,  as  opposed  to  only  one  or 
two  corrections,  is  large.  For  a  fixed  interceptor 
mass  allocation,  this  means  that  the  probability  of 
successful  intercept  is  much  higher  if  numerous 
corrections  are  made.  For  affecting  the  design, 
this  means  that  much  less  fuel  would  be  required 
if  the  capability  for  numerous  corrections  was 
built  into  the  design,  so  that  the  interceptor 
payload  could  in  turn  be  lighter,  yielding  higher 
burnout  velocity  performance  for  the  booster. 
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